Bimodal coherence based scale ambiguity cancellation for target speech extraction and enhancement

نویسندگان

Qingju Liu

Wenwu Wang

Philip J. B. Jackson

چکیده

We present a novel method for extracting target speech from auditory mixtures using bimodal coherence, which is statistically characterised by a Gaussian mixture modal (GMM) in the offline training process, using the robust features obtained from the audio-visual speech. We then adjust the ICA-separated spectral components using the bimodal coherence in the time-frequency domain, to mitigate the scale ambiguities in different frequency bins. We tested our algorithm on the XM2VTS database, and the results show the performance improvement with our proposed algorithm in terms of SIR measurements.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

The Relationship between Iranian EFL Learners’ Ambiguity Tolerance and the Accuracy of Their Task-based Oral Speech

Various individual differences, including ambiguity tolerance (AT), have gained momentum because of the influence they can exert on the process and product of learning, and thereby, on various aspects of the learner’s interlanguage system such as accuracy of oral speech. The present study was undertaken to examine the extent to which Iranian EFL learners’ AT was significantly correlated with th...

متن کامل

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Adaptation control of beamforming interference cancellation techniques is investigated for in-car speech acquisition. Two efficient adaptation control methods are proposed that avoid target cancellation. The “implicit” method varies the step-size continuously, based on the filtered output signal. The “explicit” method decides in a binary manner whether to adapt or not, based on a novel estimate...

متن کامل

Single-Microphone Speech Enhancement Inspired by Auditory System

Title of dissertation: Single-Microphone Speech Enhancement Inspired by Auditory System Majid Mirbagheri, Doctor of Philosophy, 2014 Dissertation directed by: Professor Shihab Shamma, Department of Electrical and Computer Enhancing quality of speech in noisy environments has been an active area of research due to the abundance of applications dealing with human voice and dependence of their per...

متن کامل

A new metric for selecting sub-band processing in adaptive speech enhancement systems

A multi-microphone adaptive speech enhancement system employing diverse sub-band processing is presented. A new robust metric is developed, which is capable of real-time implementation, in order to automatically select the best form of processing within each sub-band. It is based on an adaptively estimated inter-channel Magnitude Squared Coherence (MSC) relationship, which is used to detect the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Bimodal coherence based scale ambiguity cancellation for target speech extraction and enhancement

نویسندگان

چکیده

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

The Relationship between Iranian EFL Learners’ Ambiguity Tolerance and the Accuracy of Their Task-based Oral Speech

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Single-Microphone Speech Enhancement Inspired by Auditory System

A new metric for selecting sub-band processing in adaptive speech enhancement systems

عنوان ژورنال:

اشتراک گذاری